Picture for Daniel Khashabi

Daniel Khashabi

Johns Hopkins University

Trust Functions: Near-Lossless Weak-to-Strong Generalization by Learning When to Trust the Weak Teacher

Add code
May 31, 2026
Viaarxiv icon

How to Interpret Agent Behavior

Add code
May 13, 2026
Viaarxiv icon

Many-Tier Instruction Hierarchy in LLM Agents

Add code
Apr 14, 2026
Viaarxiv icon

Steered LLM Activations are Non-Surjective

Add code
Apr 10, 2026
Viaarxiv icon

GENFIG1: Visual Summaries of Scholarly Work as a Challenge for Vision-Language Models

Add code
Apr 05, 2026
Viaarxiv icon

Are Finer Citations Always Better? Rethinking Granularity for Attributed Generation

Add code
Apr 01, 2026
Viaarxiv icon

CRISP: Characterizing Relative Impact of Scholarly Publications

Add code
Mar 25, 2026
Viaarxiv icon

A Very Big Video Reasoning Suite

Add code
Feb 24, 2026
Viaarxiv icon

Safe and Interpretable Multimodal Path Planning for Multi-Agent Cooperation

Add code
Feb 22, 2026
Viaarxiv icon

Conformal Thinking: Risk Control for Reasoning on a Compute Budget

Add code
Feb 03, 2026
Viaarxiv icon